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1 Introduction 



This paper is about the following type of problem: given independent (not necessarily iden- 
tically distributed) random variables Xi, X2, . . . , Xj^, find the 'size' of \S\, where 

N 
n=l 

We will examine several ways to measure this size. The first will be through tail distri- 
butions, that is, Prd-Sj > t). Finding an exact solution to this problem would be a dream 
of probabilists, so we have to temper our desires in some manner. In fact, this problem goes 
back to the foundations of probability in the following form: if the sequence (X„) consists 
of random variables that are mean zero, identically distributed and have finite variance, find 
the asymptotic value of Prd^"! > y/Nt) as N ^ 00. This is answered, of course, by the 
Central Limit Theorem, which tells us that the answer is the Gaussian distribution. There 
has been a tremendous amount of work on generalizing this. We refer the reader to almost 
any advanced work on probability. 

Our approach is different. Instead of seeking asymptotic solutions, we will look for 
approximate solutions. That is, we seek a function f{t), computed from such that 

there is a positive constant c with 

c-'f{ct)<FTi\S\>t)<cfic-H). 

The second measurement of the size of \S\ will be through the pth moments, \\S\\p. Again, 
we shall be searching for approximate solutions, that is, finding a quantity A such that there 
is a positive constant c so that 

c-'A < \\Sl < cA. 

While this may seem like quite a different problem, in fact, as we will show, there is a 
precise connection between the two, in that obtaining an approximate formula for \\S\\p with 
constants that arc uniform as p — > 00 is equivalent to obtaining an approximate formula for 
the tail distribution. 

The third way that we shall look at is to find the size of 1 5*1 in a rearrangement invariant 
space. This hue of research was began by Carothers and Dilworth (1988) who obtained 
results for Lorentz spaces, and was completed by Johnson and Schechtman (1989). Our 
results will give a comparison of the size of IS*! in the rearrangement invariant space with 
US' lip, obtaining a greater control on the sizes of the constants involved than the previous 
works. 

Many of the results of this paper will be true of all sums of independent random variables, 
even those that are vector valued, with the following proviso. Instead of considering the sum 
S — J2n-^n, we will consider the maximal function U = sup„ Efc=i^fc|- We will define a 
property for sequences called the Levy property, which will imply that U is comparable to S. 
Sequences with this Levy property will include positive random variables, symmetric random 
variables, and identically distributed random variables. The result of this paper that gives 
the tail distribution for S is only valid for real valued sequences of random variables that 
satisfy the Levy property. However the results connecting the Lp and the rearrangement 
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invariant norms to the tail distributions of U are valid for all sequences of vector valued 
independent random variables. 

Let us first give the historical context for these results, considering first the problem of 
approximate formulae for the tail distribution. Perhaps the earliest works are the Paley- 
Zygmund inequality (see for example Kahane (1968, Theorem 3, Chapter 2)), and Kol- 
mogorov's reverse maximal inequahty (see for example Shiryaev (1980, Chapter 4, section 
2.)) Both give (under an extra assumption) a lower bound on the probability that a sum 
of independent, mean zero random variables exceeds a fraction of its standard deviation 
and both may be regarded as a sort of converse to the Chebyshev's inequality. Next, in 
1929, Kolmogorov, proved a two-sided exponential inequality for sums of independent, mean- 
zero, uniformly bounded, random variables (see for example Stout (1974, Theorem 5.2.2) or 
Ledoux and Talagrand (1991, Lemma 8.1)). All of these results require some restriction on 
the nature of the sequence (X„), and on the size of the level t. 

Hahn and Klass (1997) obtained very good bounds on one sided tail probabilities for 
sums of independent, identically distributed, real valued random variables. Their result had 
no restrictions on the nature of the random variable, or on the size of the level t. In effect, 
their result worked by removing the very large parts of the random variables, and then using 
an exponential estimate on the rest. We will take a similar approach in this paper. 

Let us next look at the pth moments. Khintchine (1923) gave an inequality for Rademacher 
(Bernoulli) sums. This very important formula has found extensive applications in analysis 
and probability. Khintchine' s result was extended to any sequence of positive or mean zero 
random variables by the celebrated result of Rosenthal (1970). The order of the best con- 
stants as p — > cxD was obtained by Johnson, Schechtman and Zinn (1983), and Pinelis (1994) 
refined this still further. Now even more precise results arc known, and we refer the reader 
to Figiel, Hitczenko, Johnson, Schechtman and Zinn (1997). However, the problem with all 
these results is that the constants were not uniformly bounded as p — > oo. 

Khintchine's inequality was generalized independently by Montgomery and Odlyzko (1988) 
and Montgomery-Smith (1990). They were able to give approximate bounds on the tail prob- 
ability for Rademacher sums, with no restriction on the level t. Hitczenko (1993) obtained 
an approximate formula for the Lp norm of Rademacher sums, where the constants were 
uniformly bounded as p — > oo. (A more precise version of this last result was obtained in 
Hitczenko-Kwapieh (1994) and it was used to give a simple proof of the lower bound in 
Kolmogorov's exponential inequality.) 

Continuing in the direction of Montgomery and Odlyzko, Montgomery-Smith and Hitczenko, 
Gluskin and Kwapieh (1995) extended tail and moment estimates from Rademacher sums 
to weighted sums of random variables with logarithmically concave tails (that is, -Pd-'^l > 
t) = exp(— 0(t)), where (f) : [0, oc) [0, oo) is convex). After that, Hitczenko, Montgomery- 
Smith, and Oleszkiewicz (1997) treated the case of logarithmically convex tails (that is, the 
4> above is concave rather than convex). It should be emphasized that in the last paper, the 
result of Hahn and Klass (1997) played a critical role. 

The breakrough came with the paper of Latala (1997), who solved the problem of finding 
upper and lower bounds for general sums of positive or symmetric random variables, with 
uniform constants as p — oo. His method made beautiful use of special properties of the 
function t h- > t^. In a short note, Hitczenko and Montgomery-Smith (1999) showed how to 
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use Latala's result to derive upper and lower bounds on tail probabilities. Latala's result is 
the primary motivation for this paper. 

The main tool we will use is the Hoffmann- J0rgensen Inequality. In fact, we will use a 
stronger form of this inequality, due to Klass and Nowicki (1998). The principle in many 
of our proofs is the following idea. Given a sequence of random variables we choose 

an appropriate level s > 0. Each random variable X„ is split into the sum Xn~^^ + X^^\ 
where Xn-**-* = X„J|x„|<s, and Xi>^) = X„J| x„\>s- It turns out that the quantity (xi^*-*) 
can either be disregarded, or it can be considered as a sequence of disjoint random variables. 
(By "disjoint" we mean that the random variables are disjointly supported as functions on 
the underlying probability space.) As for the quantity ^„Xn~*'', it will turn out that the 
level s allows one to apply the Hoffmann- J0rgensen/Klass-Nowicki Inequality so that it may 
be compared with quantities that we understand rather better. 

Let us give an outline of this paper. In Section 0, we will give definitions. This will 
include the notion of decreasing rearrangement, that is, the inverse to the distribution func- 
tion. Many results of this paper will be written in terms of the decreasing rearrangement. 
Section ^ is devoted to the Klass-Nowicki Inequality. Since our result is slightly stronger 
than that currently in the literature, we will include a full proof. In Section ^, we will in- 
troduce and discuss the Levy property. This will include a "reduced comparison principle" 
for sequences with this property. Section ^ contains the formula for the tail distribution of 
sums of real valued random variables. Then in Section |^, we demonstrate the connection 
between Lp-norms of such sums and their tail distributions. In Section |^ we will discuss 
sums of independent random variables in rearrangement invariant spaces. 

2 Notation and definitions 

Throughout this paper, a random variable will be a measurable function from a probability 
space to some Banach space (often the real line). The norm in the implicit Banach space 
will always be denoted by | ■ |. 

Suppose that / : [0, oo) [0, oo] is a decreasing function. Define the left continuous 
inverse to be 

f~\x-) = snp{y : f{y) > x}, 
and the right continuous inverse to be 

f-\x+) = snp{y : f{y) > x}. 

In describing the tail distribution of a random variable X, instead of considering the 
function 1 1— > Pr(|X| > t), we will consider its right continuous inverse, which we will denote 
by X*{t). In fact, this quantity appears very much in the literature, and is more commonly 
referred to as the decreasing rearrangement of |X|. Notice that if one considers X* to be a 
random variable on the probability space [0, 1], then X* has exactly the same law as |X|. We 
might also consider the left continuous inverse t ^— X*{t—). Notice that X*(t) < x < X*(t— ) 
if and only if Pr(|X| > x) < t < Pr(|X| > x). 

If A and B are two quantities (that may depend upon certain parameters), we will write 
A ^ B to mean that there exists positive constants Ci and C2 such that Ci^A < B < C2A. 
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We will call Ci and C2 the constants of approximation. If f{t) and g{t) are two (usually 
decreasing) functions on [0, 00), we will write f{t) ~ g{t) if there exist positive constants Ci, 

C2, C3 and C4 such that c^^f{c2t) < g{t) < Csf{c^H) for all t > 0. Again, we will call Ci, C2, 
C3 and C4 the constants of approximation. 

Suppose that X and Y arc random variables. Then the statement Pr(|X| > t) ^ Pr(|y | > 

t) is the same as the statement X*{t) f» Y*{t). Since X*{t) = for t > 1 the latter 

statement is equivalent to the existence of positive constants Ci, C2, C3, C4 and C5 such that 
c^^X*(c2t) < Y*{t) < CsX*{c^h) for t < 

To avoid bothersome convergence problems, we will always suppose that our sequence 
of independent random variables (Xn) is of finite length. Given a sequence of independent 
random variables when no confusion will arise, we will use the following notations. 

If is a finite subset of N, we will let Sa = S„gA^»^' = sup^g^ \Xn\. If A; is a 

positive integer, then Sk = S^i^,,,^k} and = M^i ... fc}. We will define the maximal function 
Uk — sup]^<„<;;. Furthermore, S — Sn, M — M^, and U — Um, where N is the length 
of the sequence {X^^. 

If s is a real number, we will write xi^*'' = XnI\Xn\>s and X^"^'' = X„/|x„|<s = Xn—X^^\ 
For A C N, we wiU write S^^^'' = XlneA Similarly we define S^^^\ Slr^\ etc. 

Another quantity that we shall care about is the decreasing rearrangement of the disjoint 
sum of random variables. This notion was used by Johnson, Maurey, Schechtman and 
Tzafriri (1979), Carothers and Dilworth (1988), and Johnson and Schechtman (1989), all in 
the context of sums of independent random variables. The disjoint sum of the sequence {Xn) 
is the measurable function on the measure space i7 x N that takes {uj, n) to X„(c<j). We shall 
denote the decreasing rearrangement of the disjoint sum by i : [0, 00) — >■ [0, 00], that is, l{t) 
is the least number such that 

Y,'PT{\Xn\>l{t))<t. 

n 

Define i{t) to be £{t) if < t < 1, and otherwise. Since £{t) is only non-zero when 
< t < 1, we will think of £ as being a random variable on the probability space [0, 1] with 
Lcbcsgue measure. The quantity £ is effectively M in disguise. This next result essentially 
appears in Johnson and Schechtman (1989). 

Proposition 2.1 IfO<t<l, then 

£{2t) < i{t/{l - t)) < M*{t) < £{t). 



Proof: The first inequahty follows easily once one notices that both sides of this inequahty 
are zero if i > 1/2. 

To get the second inequality, note that, by an easy argument, if cci, q;2, • • • > with 
Y.n^n<^, then 



i-E".^na-«J^i-r|f£t:- 

r) n * — ' ' t 
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So, if Pr(£ >x) = EnPr(l^n| > x) < 1, then 

Pr(M > a;) = 1 - - Pr(|X„| > x)), 

and hence 

^ , < Pr(M > x) < Pr(£ > x). 

1 + Pr(£ > x) - ^ ; - V ; 

Taking inverses, the resuh follows. □ 

3 The Klass-Nowicki Inequality 

This section is devoted to the following result — the Klass-Nowicki Inequality. 

Theorem 3.1 Let (Xn) be a sequence of independent random variables. Then for all positive 
integers K we have 

pr(u > m - m < ± p,(M > t). 

The original inequality of this form was for Rademacher (or Bernoulli) sums and K = 2, 
and was due to Kahane (1968). This was extended by Hoffmann- J0rgensen (1974) to general 
sums, at least for positive or symmetric random variables, for the case K = 2. Indeed, if one 



wants Theorem ^Tl| for K > 2, but without the K\ factor, this may be obtained by iterating 
the Hoffmann- J0rgensen Inequality, as was done by Johnson and Schechtman (1989, Lemmas 
6 and 7). (Both Kahane and Hoffmann- J0rgensen obtained slightly different constants than 
those we have presented. Also, in neither case did a factor like (1 — Pr(f/ > t)) appear in 
their formulae.) 



Klass and Nowicki (1998) were able to obtain Theorem |3.1| , at least in the case when the 
random variables are positive or symmetric. (However their constants are better than ours.) 
Removing the positive or symmetric condition is really not so hard, but because it does not 



appear in the literature in this manner, we will give a complete proof of Theorem p3 . 

We also note that this inequality has some comparison with a result that appears in 
Ledoux and Talagrand (1991, Theorem 6.17.) 

Proof: Let be the length of the sequence (X„). During this proof, let us write (m, n] for 
the set of integers greater than m and not greater than n. 
We start with the observation 

Fr{U > {3K - l)t) < Ft{U > {3K - l)t and M <t) + Pr(M > t). 

Now, if we have that both U > {3K — l)t and M < t, then we ensure the existence of an 
increasing sequence of non-negative integers mo, . . . , rrix, bounded by N, and defined as 
follows. Set tuq = 0. If we have picked mi_i, let m/ be the smallest positive integer greater 
than rrii^i such that \S(rni_i,'mi]\ > 2t. For / = 1, it is clear that such an integer exists. Let 
us explain why the integer rrii < N exists if 2 < I < K. 
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For 1 < I' < I — 1, and m/'_i < k < rrii' — 1, we have that \S(^rni,_-^,k]\ < 2i and jX^^, | < t. 
Hence 



and 



< (3/' - 



'S'(mj_i,mj-l] + ^mjj + S(^rni,_i,k] 

But we know that there exists a number m such that \Sm\ > (SX — l)t. Hence, we must 
have that m > m;_i, and that \S(^m^_^^rn]\ > (3i^ — 1)^ — 3(Z — l)t > 2t. 
Therefore 

Pr(C/ > {3K - l)t and M < < ^ Po,miPmi,m2 • • -PrnK-umK: 

l<mi<—<mK<N 

where 

Pm,n = Pr(|'S'(r„,fe]| < 2i for m < /c < n, and |5'(^,n]| > 2i). 
Now let us show the following inequality: 



E 

k=m+l 



Pm,k 



< 



1 - Fr{U > t) 



E 

fc=m+l 



Pk, 



where 



pn = Prd'S'fel <t ior 1 <k <n, and > t). 
Using independence, we have that 



Pm.fe = Pr( sup \Sk-Sm\>2t) 

k=m+l "*<'=^" 

= Pr( sup \Sk - Sm\ > 2t\Um < t) 

m<k<n 

< Pr( sup \Sk\ >t\Urr,< t) 
m<k<n 

Pr(sup^<jk<„|5fe| > t and 



< 



1 



1 -Pr(t/ > t) 



Pr{Um < t) 
n 

J2 



k=m+l 



as required. 
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Now we rearrange the sum as follows: 

^ ] Po,miPmi,m2 ' ' ' PmK-i,TnK 

l<m\<—<mK<N 

N 

^ ] Po,miPmi,m2 ' ' ' PmK-2,mK-i ^ ] PmK-i,mK 

l<mi<--<mK-i<N mK=fnK-i+^ 

N 



^ 1 — Prft/ > P0,miPmx,m2 ' ' ' PmK-2,mK-i P' 



l<mi<---<mK-i<-'V mif=mK-i+l 



Now we rearrange this last quantity to get 

^ ] P0,miPmi,m2 ' ' ' PmK-ijmK 

l<mi<—<mK<N 



1 

~ 1 — PrfL'^ > t) ^ P0,miPmi,m2 ' ' ' PmK-z,mK-2PmK ^ 

rriK 

^ ^ ] PmK-2,mK-i 

mK-i=mK-2+i 



1 

— fl — Prf[/ > / y P0,miPmi,m2 ' ' ' PmK-3,mK-2PmK ^ 



rriK 



X PrriK-i- 



mif_i=m/f-2+l 

Repeating this argument (/T — 2) more times, we eventually see that 



^ ] P0,m\Pm\,m2 ' ' ' PmK-i,mK 

l<m<---<m/f <JV 

1 . . . 

^ 1 - Pt(U > t))^ ^ PmiPm2 - ■ -PrriK- 



l<mi<—<mK<N 

Now, since K distinct numbers may be rearranged in K\ different ways, we have that 

^ ] PmiPm2 ' ' ' PrriK 

l<mi<—<mK<N 



1_ ^ 

jv^l / ^ PmiPm2 PrriK 



K\ 

l<mi,m2,---,mK^N 
mi,m2v !™K distinct 



1 ... 
— / J PmiPm2 ' ' 'PrriK 



K\ 

l<mi ,m2,...,TnK^N' 



K 



Since 

N 

Y,Pk = Pr{U>t), 

k=l 

we obtain the result. □ 
Let us now understand what this result means in terms of the decreasing rearrangement. 

Corollary 3.2 There exists a universal positive constant Ci such that for any sequence of 
independent random variables [Xn), and for < t < s < 1/2 we have 

U* it) < ci „ + ^* • 

max|log(l/s), loglog(4/t)| ^ ^ 

Proof: Notice that if f,g : [0, oo) — > [0, oo] are decreasing functions, then (max{/, (yf})~^ = 
max{/^^, ^f^^}, and ii f < g, then /^^ < g^^, where here f~^ denotes either the left or right 
continuous inverse of /. Since A + B < 2 max{yl, B} for any two positive numbers A and 
B, from Theorem we have that if Pr(f/ > t) < 1/2, then for all positive integers K 

Pr(f/ > {3K-l)t) < 2max|-^(2Pr(f/ > t))^,Pr(M > t)| . 
Taking inverses, we see that if (K\t/2Y^^ < 1/2, then 



3K-1 ' ' - 1 \ 2 V 2 y / ' \2 

Now, using the fact that max{y4, B} < A + B for any positive numbers A and B, and by 
choosing K to be the smallest integer such that s < {K\t/2Y^^ , and by some elementary 
but tedious algebra, the result follows. □ 



4 The Levy Property 

Let (Xn) be a sequence of independent random variables. We will say that (Xn) satisfies the 
Levy property with constants ci and C2 if whenever A C B (1 with A and B finite, then 

FT{\SA\>Cit)<C2FT{\SB\>t). 

The casual reader should beware that this property has nothing to do with Levy processes. 

The sequence (X„) has the strong Levy property with constants Ci and C2 if for all s > 
the sequence [Xn-^^) has the Levy property with constants ci and C2. 

Here are examples of sequences with the strong Levy property. (It may be easily seen 
that in all these cases that it is sufficient to show that they have the Levy property.) 

(i) All positive sequences, with constants 1 and 1. 
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(ii) All symmetric sequences (that is, when X„ has the same law as — X„) with constants 
1 and 2. This "reflection property" plays a major role in results attributed to Levy, 
hence the name of the property. 

(iii) Sequences of identically distributed random variables. This was shown independently 
by Montgomery-Smith (1993) with constants 10 and 3, and by Latala (1993) with 
constants 5 and 4, or 7 and 2. 

We see that sequences with the Levy property satisfy a maximal inequality. 

Proposition 4.1 Let (X„) be a sequence of independent random satisfying the Levy property 
with constants c\ and C2- Then 

Pr(t/ > 3cit) < 3c2Pr(|^| > t). 

Thus M*{t) < 6ciS*{t/3c2). 

Proof: The first statement is an immediate corollary of the following result known as 
Levy-Ottaviani inequality: 

Pr(t/iv > 3t) < 3 sup Pr(|5fe| > t). 

l<k<N 

(Billingsley (1995, Theorem 22.5, p. 288) attributes this result to Etemadi (1985) who proved 
it with constants 4 in both places, but the same proof gives constants 3; see, for example, 
Billingsley. However the first named author learned this result from Kwapieh in 1980.) 
The second statement follows from the first, since M < 2U. □ 

We end with a lemma that lists some elementary properties. Part |(T}] of the lemma might 
be thought of as a kind of reduced comparison principle. 

Lemma 4.2 Let (X„) be a sequence of random variables satisfying the strong Levy property. 

(i) There exist positive constants Ci and C2, depending only upon the Levy constants of 
{Xn), such that if s < 1/2 and < t < 1, then 

(ii) There exist positive constants Ci and C2, depending only upon the strong Levy con- 
stants of (X„), such that if r < s < 1/2, and zf < t < 1, then (^(^^^*(*)))*(t) < 
c,(5(<A^*M))*(c-it). 

(iii) IfO<s<t<l, then S*{t) < (S(^*^*(^)))*(t - s), and (5(^^^*(^)))*(t) < S*{t -s). In 
particular, S*{t) < {S'^^^'*^'/^^^y{t/2), and (5(^*'^*(*/2)))*(t) < S*{t/2). 

(iv) Fora,P > 0, we have that 

(^(<M*{t)))*(^) ^ (S(^''*("*)))*(/3t) 

where the constants of approximation depend only upon a, P and the strong Levy 
constants of (X„). 
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(v) We have that 

S*{t) ^ ^ 
where the constants of approximation depend only upon the strong Levy constants of 



Proof: Let us start with part |(T]]. For each set ACN, define the event 

Ea = {\Xn\ < M*{s) if and only if n G A}. 
Note that the whole probability space is the disjoint union of these events. Also 

{|5(<Arw)| > ^} n = {\Sa\ >x}n Ea. 
Furthermore, by independence, we see that 
Prd^Al > X and Ea) 

= Ft{\Sa\ > X and |X„| < M*{s) for n e A) Pr(|X„| > M*{s) for n ^ A). 

Hence 

Pr(|5{<M*{^))| >a;) 

= J2 ^^il^M > X and \Xn\ < M*{s) for n e A) Pr(|X„| > M*{s) for n ^ A) 

ACN 

< 2 ^ Prd^Al > x) Pr(|X„| < M*{s) for n E A) Pr(|X„| > M*(s) for n ^ A) 

ACN 

<C2Pr(|5| >crM, 
where here we have used the fact that 

Pr(|X„| < M*{s) for n e A) > Pr(M < M*{s)) >l-s> 1/2. 



Part [n]] follows by applying part |[T]| to S**--*^' 



'(r)) 



Part |(iii)| follows from the observation that 

Ft{S ^ < Pr(M > M*{s)) < s. 

Hence, if Pr(S > a) > t, then Pr(^(^*^'(«)) > a) > t - s, and conversely, if Pr(^(^*^'(^)) > 
a) > t then Pr(S' > a) >t - s. 

To show part |(iv)| , we may suppose without loss of generality that a = 1 and P > 1. 
Clearly S^-^'^'^^^^t) > S**--^^* (/3t), so we need only show an opposite inequality. From 
part |( ii )| , there are positive constants ci and C2, depending only upon the strong Levy con- 
stants of {Xn), such that for < t < 1/2 



where C3 = 02 P ■ 



Part |(v)| follows easily by combining part (iii), part (iv), and Proposition 2.1. □ 



11 



5 Tail distributions 



In this section, we will state and prove the formula for the tail distribution of the sum of 
independent, real valued, random variables that satisfy the Levy Property. 

If one restricts the formula to the case of sums of independent, identically distributed 
random variables, one obtains a formula very similar to the main result of Hahn and Klass 
(1997). The main differences are that their inequality involves one sided inequalities, and 
also that their inequality is more precise. 

This formula also has a strong resemblance to the result of Latala. As we shall show in 
Section computing the Lp norm of U is effectively equivalent to computing U*{e~^). Then 
if one notices that (1 + xY is very close to e^^ for small positive x, one can see that this 
result and the result of Latala are very closely related. Presumably one could derive Latala's 
result by combining Theorem p.l| with Theorem |6.1| . However the technical difficulties are 
quite tricky, and since Latala's proof is elegant, we will not carry out this program here. 

Theorem 5.1 Let he a sequence of real valued independent random variables satisfying 
the strong Levy property. Define the functions Fi{t) and F2{t) to be if t > 1, and if 
0<t<l, 



= inf I A > : JjE(t^"^''*"/^) < t"^ and JjE(t- 

I, n n 



^(<M*(t)) . -I— r „{<M*{i)) 

A > : JJ E(t^" < t~^ and JJ E(t-^" <t-^\. 

n n ) 

Then 

S^it) J F,it) J F,{t), 

where the constants of approximation depend only upon the strong Levy constants of 

Let us start with gaining some understanding of Orlicz spaces. There is a huge literature 
on Orlicz spaces, see for example Lindenstrauss and Tzafriri (1977). Suppose that $ : 
[0, oo) [0, oo] is a given function (usually convex with $(0) = 0). Then the Orlicz norm 
of a random variable X is defined according to the formula 

||X||^ = inf{A > : E<I)(|X|/A) < 1}. 

We will be concerned with the special functions 

l-^ — 1 
n ; t-i _ 1 

The following is a special case of results that appear in Montgomery-Smith (1992). 
Lemma 5.2 For any random variable X , and for t < 1/4, we have that 

WMU, ~ sup - — —rX a; , 
* o<x<i \og[xt) 



with constants of approximation bounded by 2. 
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Proof: Suppose first that < 1. Then E$t(X) < 1, which imphes that 

Jo 

that is, X*{x) < \og{xt)/log{t). 

Conversely, suppose that X*{x) < log(xt)/log(t) for < x < 1. Then 

*^ ^ ^-Jo \2\ogit)J t-i-1 - 



□ 



Proof of Theorem |5.1| : Let us start with the proof that S*{t) ^ Fi{t). Since the random 
variables Xr^^^*^^ are independent, we have that 

F,{t) = inf {a > : E(t^'''*'»/A) < ^-i ^^^^s^-^m./y^ ^ _ 

Now we notice that for any random variable Y, and < t < 1, we have that 

iE(rl^l) < max{E(t^),E(r^)} < E(rl^l). 



Hence 
and 



F,{t) < inf {a > : E(rl^'-*™/^l) < r'} = ||^^ 
F^it) > inf {a > : E(t-l^'-''*»/A|) < 2^1} = \\S^ 



<m)\\ 



<^W)|| 



where = ^- However, we quickly see that for x > that if t < 1/2 then 

"^tix) > ^$t(a;) > $t(a;/3), since $t is a convex function. Hence 

with constants of approximation bounded by 3. 
Next, we apply Lemma p.2|, and we see that 



F^{t) - sup ^AS^^^^m^). 

Taking a; = t, we see that the right hand side is bounded below by |(S''--^^*''^)*(t). Also, if 
t < X < 1, then 

log(a;t) 

Further, by Corollary |3]^, there exist constants Ci and C2, depending only on the Levy 
constants of (X„), such that if < x < t < cf^, then 

< c.j^((^(^^(*)))*(t) + (MW*)))*(x/2)) 

< C2((5(^'(*)))*(t)+£(t)). 
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Now, applying Proposition 2A_, Proposition O, and Lemma O part [v], we finally obtain 
the desired result. 

To show that S*{t) ~ -^2(^) is an almost identical proof. □ 



6 Lp norms 

The main result of this section establishes the relationship between the Lp norm of sums of 
random variables and their tail distributions. 

Theorem 6.1 Given Po > 0, z/p > po, and is a sequence of independent random 

variables, then 

Wl ^ t/*(e-74) + Ul ^ (f/(^^(-^/«»)*(e-74) + 
where the constants of approximation depend only upon Pq. 

We should note that we are not able to get universal control over the constants as ^ 0, 
as is shown by simple examples once one understands that \\Y\\p converges to the geometric 
mean of |y| as p ^ 0. 

Combining this with Corollary p.2| , we immediately obtain the following result that com- 
pares \\S\\^ to ll^llp. This result extends results of Talagrand, (see Ledoux and Talagrand 
(1991, Theorem 6.20), Kwapieh and Woyczyhski (1992, Proposition 1.4.2 and comments 
following it; see also Hitczenko (1994, Proposition 4.1)) and Johnson, Schechtman and Zinn 
(1983). If this result is specialized to symmetric or positive real valued random variables, 
then by considering the cases p = 2 or p = 1, it implies the inequality of Rosenthal (1970), 
including the result of Johnson, Schechtman and Zinn (1983) that gives correct order of the 
constants as p oo. 

Theorem 6.2 Let (X„) be a sequence of independent random variables and letpo > 0. Then 
there exist positive constants C\, C2 and c^, depending only upon po, such that for q > p > Po 
we have 

\\U\\, < c, , / , ,A \\U\\p + M*{c^'e-''))+c,\\M\\^ 

^ maxjp, log(e + g)| ^ ^ ^ 

< C3 r-r^-7 ^{\\UL+\\ML). 

- max{p,log(e + g)} ^" " "^^ 

Let us proceed with the proofs. First we need a lemma that allows us to deal with the 
"large" parts of U, so that they might be effectively considered as a sum of disjoint random 
variables. 

Lemma 6.3 Let (X„) be a sequence of independent random variables, and let < r < 1. 
Then we may express f/(>^(^')) = ^2^=1 where the random variables Vk are disjoint, and 
VCit) < M{t{k-l)\/r^-^). 
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Proof: In proving this result, we may suppose without loss of generality that X„ = X^^^^^\ 
that is, we may suppose that Pr(-^n 7^ 0) < r. 
If A is a finite subset of N, define the event 

Ea = {Xn 7^ if and only if n G A}. 

For each positive integer k, let Ek = IJ^cn Ea- Set Vk = UIe^- Notice that if \A\ = k, then 

\A\=k 

Pr{UlE^ > x) < ^Pr(|X„| > x/k and Ea) 

neA 

= 5^Pr(|X„| >x/A;) J] Pr(X^ ^ 0). 

nGA m,eA\{n} 

Hence, 

Pr(\4>x) = ^Pr([//s^> 

ACN 
|yl|=A: 

k 



X) 



< J2^r{\X,,\>x/k)l[FT{X,^^0) 

ii<---<ik j=l 1=1 

/ \ / \ k-1 

k 



k\ 



5^Pr(|X„|>xA) 5^Pr(X„^0) 



n 



□ 



Corollary 6.4 Let he a sequence of independent random variables, let Q < r < 1, and 
let < p < 00 . Then 

||f/(>^M)|| <2e2"^/P| 



\p — II lip 



Proof: Apply Lemma |6.3| to obtain the Vk- Using the fact that A; < 2 , we obtain that 



Thus 



00 00 /_p \A;— 1 

k=l k=l ^ '' 



2Pni£||P. 



□ 
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Proof of Theorem 6.1 : Applying Proposition 2A., we see that 

\\Ul>l\\Ml>2-'-'/^il. 

Also, we have that 

-1 



\\U\\l= / iU*{t)ydt>8-'e-P{U*{e-P/8)y, 
Jo 

that is, \\U\\p > 8-^^Pe-^U*{e-P/8) > 8-^/Pe-^U*{e-P /A). Hence we have shown that there 
exists a constant ci, depending only upon po, such that 

llf/|l,>cr^(f/*(e-74) + ii^ig. 

Furthermore, by Proposition |2.1| , 

Pr(f/ ^ f/(<^(^-V8))) < pr(^ > i{e-P/8)) < Pr(M > M*{e-''/8)) < 6-^/8. 

Hence U*{e^P/8) > ([/(-^("^ ''^^^^)*(e^P /A), and so we have shown that there is a constant 
C2 > 0, depending only upon Pq, such that 

Now let us derive the converse inequalities. Corollary p.2| tells us that for t < e~^/2 that 



Thus 



||f/{<ne-/8))||j; < r '^'((f/(<^(--V8)))*(^))P^^ + (l_e-p/2)(f/(<%-V8)))*(e-P/2) 

+ (f/{<ne-/8)))*(g-p/2) 

< c|((f/(^^(^"''/«)))*(e-72) +£(e-78))^ 
where C3 > depends only upon pq. Furthermore, 



{e-P/8f < 8eP / i{tY dt = 8e 



oPWfWP 



Hence, applying Corollary and the (quasi-)triangle inequality for Lp, we deduce that 
there exists a constant C4, depending only upon pq, such that 

ril, < C4(^(^'(^"^/^«)*(e-72) + IKig. 
Finally the result follows by noticing that 

(f/(<^(e-/8)))*(g-p/2) < (f/(<^('=-V8)))*(g-p/4)^ 

and also, by an argument similar to one presented above, that 

(^(<ne-V8)))*(g-P/2) < U*{3e-P/8) < U*{e-P/A). 

□ 
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7 Rearrangement invariant spaces 



Rearrangement invariant spaces are studied in much of the hterature, see for example Lin- 
denstrauss and Tzafriri (1977). However, we will work with a definition that is a little less 
restrictive. A rearrangement invariant space on the random variables is a quasi-normed Ba- 
nach space C of random variables such that 1 E C, and if X* < Y* and Y & C, then X G £ 
and ll^ll^ < ll^ll^- Obviously the spaces Lp for < p < oo are rearrangement invariant 
spaces. 

Given a rearrangement invariant space C, we define the quasi- constant of C to be the 
least constant K >0 such that ||X + Y\\^< K{\\X\\^ + \\y\\c) ^11 X,Y e C. Notice that 
if X*{2t) < Y*{t), and Y E C, then X may be written as the sum of two disjoint random 
variables Yi and Y2 with Y*{t),Y*{t) < Y*{t), and hence ||X||^ < 2K\\Y\\^. 

Given two rearrangement invariant spaces C and Ai, we will say that C embeds into Ai 
if there is a positive constant c such that if X E C, then X E Ai and < c||X||^. We 

will call the least such c the embedding constant of C into A^. 

Theorem 7.1 Let pq > 0, and let C be a rearrangement invariant space such that C embeds 
into Lp, and Lg embeds into C, where q > p > Po- Then there is a positive constant c, 
depending only upon the quasi-constant of C, the embedding constants, po and q/p, such that 
for any sequence of independent random variables 

c-\\\u\\p+mc)<\\u\\,<c{\\ui+mc)- 

Proof: Let us first obtain the left hand side inequality. It follows by hypothesis that 
llf^ll^ > C]^^l|f^llp) where Ci is the embedding constant of C into Lp. Furthermore, U > |M, 
and by Proposition 2.1, i{t) < M*{2t). Hence \\U\\^ > {AK)-'^\\l\\^, where K is the quasi- 
constant of C. 

Now let us obtain the right hand inequality. By Corollary |3.2| , we have that there is a 
universal positive C2 for < t < 1 

p 

Now f/*(t)/o<f<2-29/P > f/*(229/Pt), and hence 

||f/||^ < (2i^)r2'?/Plc2ir^(||t ^ U*{tP'^'i)\\^ + K\\M\\^). 

To finish the proof, suppose that \\U\\^ = A. Then it is easily seen that U*{t) < \t~^/'P. 
Thus, if C3 is the embedding constant of L„ into £, then 



|t ^ f/*(tP/29)||^ < c^\\t^U*{tP'^'')\\^ 

= €3(1 {u*{t'p/^^)ydt 



1/9 



»i \ i/q 



= 2i/^C3A. 



□ 
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